[Fix] in Xnnpack EP, the conversion for fused activation param isn't correct #23115

mszhanyi · 2024-12-16T03:42:34Z

Description

In Xnnpack EP, the activation_param's conversion isn't correct for Fp16 model
Sometimes, it may cause an exception that "lower bound must be below upper bound"
Because CPU EP doesn't support FP16 activation fusion now, so the newly added test skips the comparison of the test result.

Motivation and Context

wejoncy · 2024-12-17T03:16:15Z

onnxruntime/core/providers/xnnpack/detail/utils.cc

-                               ? *reinterpret_cast<const float*>(value.raw_data().data())
-                               : value.float_data()[0];
+            int32_t arg_type;
+            if (GetType(arg, arg_type) && arg_type == ONNX_NAMESPACE::TensorProto_DataType_FLOAT16) {


What if GetType(arg, arg_type) failed here?

Generally type info is always available, so I think this is ok. Shape info may be missing depending on the model.

The Conv op looks to be setup to allow fp32, u8, s8 and optionally fp16. Should this also handle u8 and s8 or should ClipReluChecker limit fusion to fp32 and fp16?

skottmckay · 2024-12-17T05:59:34Z

onnxruntime/test/providers/xnnpack/xnnpack_basic_test.cc

+  // So far, CPU EP doensn't support Fp16 Conv fusion, so verify_outputs is skipped.
+  RunAndVerifyOutputsWithEP(ort_model_path, "TestNhwcConvReluClipFusion_FP16", std::move(ep), feeds, params, {}, false);


Not quite following. There should still be valid output from the CPU EP even if it doesn't fuse, so why can't we use verify_outputs?

Suggested change

// So far, CPU EP doensn't support Fp16 Conv fusion, so verify_outputs is skipped.

RunAndVerifyOutputsWithEP(ort_model_path, "TestNhwcConvReluClipFusion_FP16", std::move(ep), feeds, params, {}, false);

// So far, CPU EP doesn't support Fp16 Conv fusion, so verify_outputs is skipped.

RunAndVerifyOutputsWithEP(ort_model_path, "TestNhwcConvReluClipFusion_FP16", std::move(ep), feeds, params, {}, false);

Yi Zhang added 5 commits December 16, 2024 11:37

fix activate parameter in fp16

ba52bc0

add test data

6032820

rm useless change

242c182

node assignment some for FP16

7c7f16a

update

3d75696

mszhanyi requested review from skottmckay, snnn and wejoncy December 16, 2024 08:21

mszhanyi marked this pull request as draft December 16, 2024 13:06

Yi Zhang added 2 commits December 16, 2024 22:31

update

c4f0455

head file

dd9865f

mszhanyi marked this pull request as ready for review December 17, 2024 02:13

wejoncy reviewed Dec 17, 2024

View reviewed changes

skottmckay reviewed Dec 17, 2024

View reviewed changes

mszhanyi marked this pull request as draft December 20, 2024 02:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] in Xnnpack EP, the conversion for fused activation param isn't correct #23115

[Fix] in Xnnpack EP, the conversion for fused activation param isn't correct #23115

mszhanyi commented Dec 16, 2024

wejoncy Dec 17, 2024

skottmckay Dec 17, 2024

skottmckay Dec 17, 2024

		// So far, CPU EP doensn't support Fp16 Conv fusion, so verify_outputs is skipped.
		RunAndVerifyOutputsWithEP(ort_model_path, "TestNhwcConvReluClipFusion_FP16", std::move(ep), feeds, params, {}, false);

[Fix] in Xnnpack EP, the conversion for fused activation param isn't correct #23115

Are you sure you want to change the base?

[Fix] in Xnnpack EP, the conversion for fused activation param isn't correct #23115

Conversation

mszhanyi commented Dec 16, 2024

Description

Motivation and Context

wejoncy Dec 17, 2024

Choose a reason for hiding this comment

skottmckay Dec 17, 2024

Choose a reason for hiding this comment

skottmckay Dec 17, 2024

Choose a reason for hiding this comment